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Abstract 

We define gradient networks as directed grapiis formed by local gradients of a scalar field distributed 
on the nodes of a substrate network G. We derive an exact expression for the in-degree distribution of the 
gradient network when the substrate is a binomial (Erdos-Renyi) random graph, G N,p- Using this expression 
we show that the in-degree distribution R{1) of gradient graphs on Gn^p obeys the power law R{1) oc 
for arbitrary, i.i.d. random scalar fields. We then relate gradient graphs to congestion tendency in network 
flows and show that while random graphs become maximally congested in the large network size limit, 
scale-free networks are not, forming fairly efficient substrates for transport. Combining this with other 
constraints, such as uniform edge cost, we obtain a plausible argument in form of a selection principle, for 
why a number of spontaneously evolved massive networks are scale-free. This paper also presents detailed 
derivations of the results reported in Ref. jQU. 

PACS numbers: 89.75.Fb, 89.75.Hc, 89.20.Hh, 89.75.Da 
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I. INTRODUCTION 
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It has recently been recognized ||2|, |3|, |4j] that a large number of systems are organized into 
structures best described by complex networks, or massive graphs. Many of these networks, also 



called scale-free networks, such as citation networks [5], the www la], the internet ||7|], cellular 
metabolic networks ifslB], the sex- web lioi . the world-wide airport network Ill[[l2i . and alliance 
networks in the U.S. biotechnology industry fl^, possess power-law degree distribution, P{k) ~ 
Scale-free networks are very different from pure random graphs, which are well studied 
in the mathematical literature lisl . and which have "bell curve" Poisson degree distributions. 
Therefore, it is natural to ask: Why do scale-free networks emerge in nature? 

The diverse range of systems for which scale-free networks are important suggests that perhaps 
there is a simple common reason for their development. Generally, real-world networks do not 
form or evolve simply by purely random processes. Instead, networks develop in order to fulfill a 
main function. Often that function is to transport such as information, cars, power, water, 

forces, etc. It is thus plausible that the structure that the network develops (scale-free in particular) 
will be one that ensures efficient transport. Recent studies that explore the connection between 
network topology and flow efficiency were done by a number of researchers, including Valverde 
and Sole [ 16] in the context of the internet, by Valverde Ferrer Cancho and Sole iIITII in the context 
of software architecture graphs, and Guimera et.al. in the context of the world-wide airport 
network. All these studies clearly show that the flow optimization dynamics which attempts to 
maintain the overall efficiency, will induce strong constraints on the structural evolution of the 
network. 

In this paper, we investigate the processing efficiency of flows on networks when the flows 
are generated by gradients of a scalar field distributed on the nodes of a network. This approach 
is motivated by the idea that transport processes are often driven by local gradients of a scalar. 
Examples include electric current which is driven by a gradient of electric potential, and heat flow 
which is driven by a gradient of temperature. The existence of gradients has been also shown to 
play an important role in biological transport processes, such as cell migration llsl : chemotaxis, 
haptotaxis, and galvanotaxis (the later was shown to play a crucial role in morphogenesis). 

Naturally, the same mechanism will generate flows in complex networks as well. Besides the 
obvious examples of traffic flows, power distribution on the grid, and waterways, we present two 
less known examples of systems where gradient-induced transport on complex networks plays 
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an important role: 1) Diffusive load balancing schemes used in distributed computation ligj (and 
also employed in packet routing on the internet), and 2) Reinforcement learning on social networks 
with competitive dynamics ll20ll . In the first example, a computer (or a router) asks its neighbors 
on the network for their current job load (or packet load), and then the router balances its load 
with the neighbor that has the minimum number of jobs to run (or packets to route). In this case 
the scalar at each node is the negative of the number of jobs at that node, and the flow occurs in the 
direction of the gradient of this scalar in the node's network neighborhood. In the second example, 
a number of agents/players who are all part of a social network, compete in an iterated game based 
with limited information [20]. At every step of the game each agent has to decide who's advice 
to follow before taking an action (such as placing a bet), in its circle of acquaintances (network 
neighborhood). Typically, an agent will try to follow that neighbor which in the past proved to be 
the most reliable. That neighbor is recognized using a reinforcement learning mechanism: a score 
is kept for every agent measuring its past success at predicting the correct outcome of the game, 
and then each agent follows the advice of the agent in its network neighborhood which has the 
highest score ll2flll accumulated up to that point in time. In this case, the scalar is the past success 
score kept for each agent. 

The remainder of this paper is organized as follows. In Section II we systematically build a 
framework for analyzing the properties of gradient flows on networks, which, as it will be demon- 
strated, generically organizes itself into a directed network structure without loops. In Section III, 
we obtain the exact expression for the in-degree distribution of the gradient flow network on bino- 
mial random graphs and show that in a certain scaling limit the gradient flow network becomes a 
scale-free network. We also discuss possible connections of this result to sampling biases in trace- 
route measurements that have been used to infer the topology of the internet. Finally, in Section 
IV we study how the structure of a network affects the efficiency of its transport properties, and 
offer a possible explanation in the form of a selection principle for the emergence of real-world 
scale-free networks. 



II. DEFINITION OF A GRADIENT NETWORK 

Let us consider that transport takes place on a fixed network G = G{V, E) which we will 
call in the remainder, the substrate graph. It has N nodes, V = {0, 1, — 1} and the set 
of edges E is specified by the adjacency matrix A = {aij} {aij = 1 if z and j are connected. 
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ttij = otherwise, and an = 0). Given a node i, we will denote its set of neighbors in G by 
Sl^^ = {j E V \ ttij = 1}. Let us also consider a scalar field (which could just as well be called 
'potential landscape') h = {Hq, .., h^-i} defined on the set of nodes V, so that every node i has a 
scalar value hi associated to it. 

We define the gradient Vhi of the field h in the node i to be the directed edge Vhi = (i, 
which points from i to that neighbor, G S^^^ U {i} on G at which the scalar field has the 
maximum value in S^^^ U {i}, i.e.: 

^{i) = argmax (hj), (1) 
i65f)u{i} 

see Fig. [H According to its classical definition, a gradient vector points in the direction of the 
steepest ascent at a point on a continuous (c/-dimensional) landscape. The above definition is a 
natural generalization to the case when the continuous landscape is replaced by a graph. 

Note that = i,ifi has the largest scalar value in its neighborhood (i.e. in the set S^^^ U {i}), 
and in this case the gradient edge is a self-loop at that node. Since h always has a global maximum, 
there is always at least one self-loop. It is possible that Eq. ([T]) has more than one solution (several 
equal maxima) in the case of which we say that the scalar field is degenerate. In this paper we deal 
only with non-degenerate fields, which is typical when for example h is a continuous stochastic 
variable. 







— — -;;^d 



FIG. 1 : Definition of a gradient on a network. The gradient at node i is a directed edge pointing towards the 
maximum value of the scalar (node /x) in the node's neighborhood. 
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This allows us to define: the set F of gradient edges on G, together with the vertex set V form 
the Gradient Network, VG = WG{V, F). 

Assuming that all edges have the same 'conductance', or transport properties, the gradient 
network will be the substructure of the original network which at a given instant will channel the 
bulk of the flow, and thus alternatively can be called as the maximum flow subgraph. 

In general, the scalar field will be evolving in time, due to the gradients generated currents, 
and also to possible external sources and sinks on the network (for example packets are generated 
and used up at nodes, but they can also be lost). As a result, the gradient network VG will be 
time-dependent, highly dynamic. 

A. Some general properties of Gradient Networks 

Here we will first show a number of fundamental structural properties valid for all instanta- 
neous gradient networks, and then study the degree distribution of gradient networks generated 
by stochastic scalar fields on random graphs, and scale-free networks. This will lead us to show 
that scale-free networks are more efficient substrates for transport than random graphs. The first 
important observation we make about gradient networks is: 

Non-degenerate gradient networks form forests (i.e., there are no loops in VG, and it is a union 
of trees, more exactly of in-directed, planted pines). 

To prove this statement, assume that on the contrary, there is a closed path 7 = 
{V/iii, V/ii2, V/ii„}, m > 3 made up only of directed edges from F, see Fig. |2l Let 4 be 
the node on this path for which hi^ = min{/ij^, hi^, hi^}. Node ik has exactly two neighbors 
on 7, nodes ik±i, but only one gradient direction, V/ij^ pointing away from ik. Since hi^^^^ > hi^, 
none of the neighbors ik±i will have their gradient edges pointing into i^. Since there are two 
edges, (ifc, ik~i) and (i^, ik+i), and only one gradient edge from i^, one of the edges must not be a 
gradient edge, and thus the loop is not closed, in contradiction with the assumption that 7 is a loop 
with only gradient edges. Using a similar reasoning we can show that for non-degenerate scalar 
fields, there is no continuous path in VG connecting two local maxima of the scalar field h. This 
means that on a given tree of VG there is only one local maximum of the scalar, and it is the only 
node with a self-loop on that tree. As a consequence, the number of trees in the forest equals the 
number of local maxima of the scalar fleld h on G. The fact that VG is made of trees (no loops), 
is rather advantageous for existing analytical techniques, especially if we take into consideration 
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that VG is the most important substructure driving the flow in the network. Note that unless there 
is exactly one local maximum (and thus global as well) of h on G, VG is disconnected into a 
number of trees and thus VG is not a spanning tree. Since every node has exactly one gradient 
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undirected edge 
FIG. 2: There cannot be loops in a non-degenerate VG. 



direction from it, the out-degree of every node on the gradient network is unity. It also means that 
VG has exactly nodes and edges (with at least one edge being a self-loop). However, the 
in-degree of a node i, which is the number of gradient edges pointing into i, can be anything in the 
range fcf"'' G {0,l,..,/cj}, where ki is the degree of node i onG. 

III. THE IN-DEGREE DISTRIBUTION OF A GRADIENT NETWORK ON RANDOM GRAPHS 
AND RANDOM FIELDS 



n this section we show that when the substrate graph G is a binomial random graph G = Gat ^ 
, and h is an i.i.d. (meaning independent identically distributed) random field over V, given 

/} of VG obeys the exact 



by a distribution ri{h), the in-degree distribution R{1) = Prob.{A;f"'' 
expression: 

N~l 



n=0 



(2) 



an expression independent on the particular form of the distribution for the scalars, r]{h). The 
binomial random graph (also coined in the physics literature as the Erdos-Renyi random graph) 
is constructed by taking all pairs of N nodes and connecting them with probability p, inde- 
pendently from other links. Figure |3l shows the agreement between numerical simulations and the 
above exact expression. We will also show, that the gradient network VG becomes a scale-free 
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FIG. 3: Comparison between the exact formula ^ and numerics. Here N 
The numerical values are obtained after averaging over 10^ sample runs. 
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1000, p = 0.1, (z = 100). 



(or power law) network with respect to the in-degree distribution, in the scaling limit N ^ oo and 
p 0, such that Np = z = const. ^ 1. The in-degree distribution in this limit is described by 
the law: 

R{1) ~ ^ , < / < z . (3) 



a behavior which is also apparent from Fig. |3l This power-law is a rather surprising result, since the 
substrate graph is a random graph which is not scale-free, its degree distribution (in the same limit) 
being Poisson, with a well defined average degree z (setting the scale) and faster than exponential 
decaying tails lisi . 

A similar finding was reported in by Lakhina et.al. by repeating the trace-route measure- 
ments employed to sample the structure of the internet, on binomial random graphs. Lakhina et. 
al. find that the spanning trees obtained this way have a degree distribution that obeys the 1/k 
law. Later, in Ref. ll22ll . Clauset and Moore have presented an analytical approach to derive the 
1/k law. This suggests that perhaps there could be a mapping between the trace-route sampling 
generated graphs and gradient networks. Although it is not an exact mapping, a close connec- 
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tion can indeed be made, and this will form the subject of a forthcoming publication. The main 
warning sign coming from the trace-route observations is then the fact that trace-route sampling 
might not be the best way to measure the structure of the internet, given that on random binomial 
graphs it fails miserably to reproduce its degree distribution (instead of a Poisson, it gives a 
law). However, all is not lost, for the following reason: trace-route measurements of the internet 
suggested a power-law dependence for its degree distribution, \/kP' with an exponent of 7 taking 
values between 2 and 3, which is definitely not close to unity! This excludes the binomial random 
graph as a model for the internet. One might then wonder for what kind of graphs will trace-route 
measurements suggest a power-law dependence with an exponent 7 > 2? In Section III, we make 
the observation that if the substrate graph is a scale-free network with degree distribution given 
by l/fc''' (7 > 2) then the corresponding gradient network will also be a scale-free network with 
the same exponent 7. Using the above mentioned close mapping between trace-route trees and 
gradient networks (namely, trace-route trees can be interpreted as suitably constructed gradient 
networks) this suggests that at least the assumption that the internet is a power-law graph with 
exponent 7 > 2 is consistent with trace-route measurements. Certainly, the problem of sampling 
biases generated by trace-routes could be elucidated by answering the following question: are 
there non-power-law substrate graphs which would still generate scale-free trace-route trees with 
exponent 7 > 2? This is an open question wordy of further investigation. 

IV. DERIVATION OF THE EXACT EXPRESSION 

In this section we give a combinatorial derivation for formula Q. A more analytic and standard 
approach can be found in the Appendix, which was our original method, and it has inspired the 
combinatorial one presented below. 

In order to calculate the in-degree distribution R{1) this way, we first distribute the scalars on 
the node set V , then find those link configurations which contribute to R{1) when building the 
random graph G over these nodes. 

Without restricting the generality we will calculate the distribution of in-links for node 0. Let 
us consider a set of n nodes from V , that does not contain node 0, and it has the property that 
the scalar values at these nodes are all larger than Iiq. We will denote this set by {r}„. The 
complementary set of {t}„ mV \ {0} will be denoted by see Fig. HI 

In order to have exactly I nodes pointing their gradient edges into node 0, we must fulfill the 
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following conditions: first, they have to be connected to node and, second, they must not be 
connected to the set {r}n (otherwise, they would be connected to a node with a scalar value larger 
than ho, according to the definition of {r}„). The probability for one node to fulfill these two 
conditions is p(l — p)", and since the links are drawn independently, for / nodes this probability 
is [p{l — p)"]'. We must also require that no other nodes will have their gradient links pointing 
into node 0. Obviously, by definition, nodes from {r}„ will not be pointing gradients into node 0. 
Therefore, we have to make sure, that none of the remaining N — 1 — I — n nodes from C{^}^ will 
be pointing into 0. For one such node this will happen with probability 1 — p{l — p)". For all the 
N — 1 — I — n such nodes this probability will be [1 — p(l — p)"]^^ ' Thus, given a specific 
set {rjn, the probability of exactly / in-links to node is: 



The combinatorial factor in Q counts the number of ways the set of / nodes which point their 
gradient edges to node 0, can be chosen from C{^}^. 



The probability in ^ was computed by fixing ho and the set {r}„. Next, we compute the 
probability Qn of such an event for a given n, while letting the field h vary according to its 
distribution. The probability for a node to have its scalar value larger than ho is: 




(4) 




N.p 



FIG. 4: Schematic of the construction given in the main text. 




(5) 
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The probability to have exactly n nodes with this property is given by: 

[^{hoT [1 - ^iho)f-'-'' . (6) 

The number of ways the n nodes can be chosen from V \ {0} is just the binomial (^^^) ■ Thus, 
the total probability Qn will be given by: 

Qn = (^^ ^) J dho riiho) [i{ho)r [1 - l{ho)f-'-'' = ^ , (7) 

where the last equality in the above is obtained after performing the change of variables du = 

d'j{ho) = dho r]{ho). 

As a final step, by combining Q with ©, and summing over all possible n values, we arrive 
at 0. 

A. Derivation of the 1 / / scaling for the in-degree distribution 

In order to obtain the scaling l/l valid in the limit N ^ oo, p ^ 0, such that z = pN = 
const. ^ 1, for < / < 2;, we will use the saddle point method. We write equation Q first in the 
form R{1) = jj: J2n=i ^N^n, I) and then exponentiate the argument. Using the Stirling's formula 
to the first order (In (x!) ~ a;(lnx — 1)), one obtains that r^in, I) ^ eiN(n,i)^ where: 

gjv(n, I) = {N- n) In [{N - n) /e] - l\n{l/e) - {N - n - I) In [{N - n - I) /e] 

+ {N -n-l)\n{l-p{l- p)"-^) + l[\np + (n - l)ln{l - p)] (8) 

To calculate the largest contributor under the sum in © we use the saddle point method: 

J dxe^''^'^ ~ \/2Txe^^^°^ / a/— /"(xq) where f'{xo) = 0. In our case we thus need to consider: 



= (9) 

n*{l) 



dn 

where n*{l) denotes the index of the maximal contributor for a given /. The difficulty we get into 
by trying to find n*{l) from ® is that the equation cannot be solved explicitly for n*{l). To get 
around this, let us consider instead the derivative: 

dqN{n,l) 



= (10) 

i(n) 



dl 

defining l{n). Performing the derivation the solution is easily found as: 

/(n) = (A^ - n)p(l - pY~^ . (11) 
10 



Since i{n) is a monotonic function of n, it is invertible. (i'{n) < 0). The inverse of dTTT) . will be 
denoted by n{l). This means that: 

dqN{n,l) 



dl 



= . (12) 

HI) 



Next, we observe that i(n) satisfies ^ when inserting it into its explicit expression. Accordingly, 
it will also be satisfied by h{l). Assuming that there is only one solution to ® it thus follows that: 

n*{l) = nil) . (13) 

If we now calculate gAr(n, /), at the saddle point, we find that q]\f(n*{l), I) = (using the fact that 
the parametric curve of the maximum can be written as either (n* (l) , /) or (n, l{n)) and thus calcu- 
lating gAr(n, /(n))). This means that we need to go one step further in the Stirling series, in order 
to calculate the leading piece of e'"^^^*^"^'') at the saddle point. For the saddle point itself, we use 
the same expression as previously (obtained with the first order Stirling approximation) because 
as it can be shown, the corrections introduced by the next term in the Stirling approximation are 
vanishing as ^ oo and therefore they will be neglected. Thus using the next order term in as 
well in the Stirling series (In (x!) x(ln x — 1) — In ( y^) + In ( v^27r)) and writing 

InrNinJ) ^ qN{n,l) + SN{n,l) , (14) 

where SAr(n, /) is the correction generated this way, we obtain: 



Calculating the second derivative d'^qN{n,l)/ dn^ at the point (fTTT) . one finally obtains: 

Combining (fT^ with ([T?b . ( fT5t in the saddle point formula, one obtains that: 

R{1) ^ ^, (17) 

valid in the domain I < I < Ic- The cutoff value l^. is determined by the validity range of the 
saddle-point method: since the function n*{l) is monotonically decreasing, at / = it will hit the 
lowest allowed value by the range of the integral (or sum), namely, at = 1. Since / is the 

inverse function of n*, it follows that 

Ic = i{n*{lc)) = /(I) =p{N-l)=z (18) 
11 



meaning that the cutoff for the l/l scaling law happens at z, which is indeed confirmed by the 
numerical simulations shown in Fig. |3l 

V. SCALE-FREE NETWORKS : RESULTS OF A SELECTION MECHANISM? 

If the substrate graph is a scale-free network (here we used the Barabasi-Albert (BA) process 
with parameter m to generate the scale-free network lO], but others will lead to similar conclusions 
as long as 7 > 2), the gradient graph will still be a power-law. Here m is the number of "stubs" of 
an incoming node which will attach preferentially to the existing network in the growth process. 
Figure 121 shows the in-degree distribution of the corresponding gradient networks (for m = 1 and 
m = 3, lines-markers) which are to be compared with the degree distribution of the substrate 
network itself (lines). One immediate conclusion is that the gradient network is the same type 
of structure as the substrate in this case, i.e., it is a scale-free (power law) graph with the same 
exponent! 



10° r 




k I 

FIG. 5: The degree distributions of the gradient network (R{1)) and the substrate (P{k)), when the substrate 
is a BA scale-free graph with parameter m (m = 1, 3). Here = 10^, and the average was performed over 
10^ samples. 

If A^^'-*"'' denotes the number of nodes with / in-links, the total number of nodes receiving gradi- 
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ent flow wiU be Nreceive = J2i>i Ni . The total number of gradient edges generated (total flow) 
is simply Nsend = N because every node has exactly one out-link. 

If the flow received by a node has to be processed, it will happen at a finite rate. For example, 
a node receiving a packet, has to read off its destination and find out to which neighbor to send 
it. This is a physical process and takes a non-zero amount of time. Thus, if a node receives too 
many packets per unit time, they will form a queue, and long queues will generally cause delays 
in information transmission and thus leads to congestion, or jamming. An important question then 
arises: Can the topology of the underlying substrate graph influence the level of congestion in the 
network? 

The answer is yes, as illustrated through the following trivial examples, a) If the network was a 
star-like structure as in Fig. |6^), then obviously all pairs of nodes would be at most two hops from 
each other, which is advantageous from the point of view of shortest distance between sources 
and destinations (and also routing would be very simple), however it would not work for large 
networks, because the central node would have to handle all the traffic from the other nodes and 
would have to process an extremely large queue, b) On the contrary, if the network would have 
a ring-like structure as in|6j5), then in average there would be one server per one client, a rather 
advantageous setup from the point of view of having no congestion, however, there would be no 
short distances for transmission. More importantly, for both structures in Fig. |6l the network is 
rather vulnerable: in case of a) the failure of the central node, and in the case of b) the failure of 
any node, would cause a complete breakdown of transmission. 



a) * 
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• 

» • 


• 
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• 

» 



b) 



FIG. 6: Illustrating the influence of structure on flows. In a) the central node will have maximal congestion, 
while in b) there is no congestion. 

To characterize this interdependence between structure and flow in more generality, we intro- 
duce the ratio Nreceive/ N send, which, naturally, will be related to the instantaneous global con- 
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gestion in the network. As explained above, if this ratio is small, then there will be only a few 
nodes {Nreceive) processing the flow of many {Ngend) others and therefore, long queues are bound 
to occur, leading to congestion. 

If we assume that flows are generated by gradients in the network, we can define: 

J = 1 - //^^££!!£\ \ = i?(o) (19) 

\ \ J-^send I hi nw 

as the congestion {or jamming) factor. Certainly, J = 1 means maximal congestion and J = 
corresponds to no congestion, and we always have J G [0, 1]. Note that J is rather a congestion 
pressure characteristic generated by gradients, than an actual throughput characteristic. 

For the random graph substrate Gat^p , J^{N,p) = ^ J2n=i ~ Pi^ ~ We can 

show that in the limit p = const, and N ^ oo, J^{N,p) = 1 T^T + C> (^)] 1, 

i.e., the random graph becomes maximally congested. When z = Np is kept constant while 

^ oo, a good approximation is J^{z) = MmN^oo J ^{N, z/N) = Jq dx e"^*^^"^)"^ . The 
J^{z) function has a minimum at z* = 2.8763..., when J^{z*) = 0.6295.. and J^(0) = 1. Since 
gzxe |g always bounded: e^^^ G [l, e^^^~\ , for all x > and z > 0, J^{z) > dx e~^*^ '''' = 
i [Ei{-z) - Ei {-ze^^)]. Expanding for z » 1, we obtain: .J^{z) > 1 - ^ - ... 1, 
(C is the Euler constant) i.e., the random graph asymptotically becomes maximally congested, 
or jammed. The latter result (up to corrections in 1/z) can immediately be obtained if one uses 
directly the asymptotic form (fTTb and the fact that R{0) ^ 1 ~ J-^ dlR{l). 

For scale-free networks, however, the conclusion about jamming is drastically different. We 
find that the jamming (pressure) coefficient J is always a constant, independent of N, in other 
words, scale-free networks are not prone to maximal jammingl In particular, J^^{N, m = 1) = 
0.6839... and J^^{N, m = 3) = 0.6955... Figure IT] shows as comparison the congestion factors as 
function of network size both for random graph and scale-free network substrates. 

Most real-world networks evolve more-or-less spontaneously (like the internet or www) and 
they also can reach massive proportions (order 10* nodes). At such proportions, pure random 
graph structures would generate maximal congestion pressure (practically equal to unity) in the 
network and thus such substrates would be very inefficient for transport. Scale-free networks, 
however, have a congestion pressure which is a constant bounded well away from unity, and thus 
they are rather efficient substrates for transport. So why not all real-world networks are scale- 
free? Our analysis assumed that all edges have the same transport properties (conductance, or 
"cost"), which is true for some networks like the www, the internet, and not true for others: power 
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FIG. 7: The congestion coefficient for random graphs (o, p = 0.05; o,p = 0.1) and scale-free networks (+, 
m = 1; □, m = 3). 

grid, social networks, etc. When there are weights on the edges (conductance or cost) the actual 
transport efficiency (determined by actual throughput) will strongly depend on those and therefore 
also select the network topology. Note that the congestion pressure depends on local properties 
of network topology (2-step neighborhoods). Thus we expect that all networks with similar 2-step 
neighborhood distributions would have similar gradient networks, not just the models studied here. 

In summary, after introducing the concept of gradient flow networks, we have shown why 
certain complex networks might emerge to be scale-free. We do not give a specific mechanism 
for network evolution, which we believe to be network and process dependent, and therefore 
not universal. Instead, the network evolution mechanism is selected such, that the transport (the 
network's main function) is efficient, while a number of constraints imposed by the specific nature 
of the network (edge cost, conductance, etc.) are obeyed. 

In the case of the internet, if a router is constantly jammed, engineers will resolve the problem 
by connecting to the network other routers in its network vicinity, to share the load. This is a 
tendency to optimize the network flow locally. A series of such local optimization processes will 
necessarily have to constrain the global structure of the network. It seems that scale-free networks 
are within the class of networks obeying this type of constraint. 
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APPENDIX A: AN ANALYTIC DERIVATION OF THE IN-DEGREE DISTRIBUTION 

When calculating the degree distribution, we have to perform two averages: one corresponding 
to the scalar field disorder 

{•)h = j dho...dhN-i ri{ho)..r]{hN_i)» , (Al) 
and the other to an average over the network (graph ensemble): 

{•)nw = ^--- ^ v{aoi)...v{aN-2N-i)» , (A2) 

aoi ajv-2]V-i 

where v{a) = — pY'"', a G {0, 1} and J2a = Z]a=o- ^^^^ ^ binomial random graph 
Gn,p with nodes and link-probability p. The integrals in (lAll) are computed over the range of 
the scalar field and the summation in (IA2I) is over all A^(A^ — l)/2 pairs (i, j) with i < j . 

In order to calculate the in-degree distribution, we define first a counter operator for the in- 
links. Without restricting the generality we calculate the in-degree of the gradient network for 
node namely, /cq"''. Let us introduce B = I + A, where I is the N x N identity matrix so 
bij = 6ij + ttij, and the quantities: Hi{j) = 1 — bij + bij 6{ho — hj) for i,j G V, and i G Sq^\ 
Thus, the in-link counter can be written as: 

N-l 

4^"^=E«o.n^^(^')- (A3) 

i=i j=i 

With the aid of Fig. [Hwe see that indeed this expression will count the number of gradient edges 
into node 0: Hi{j) is zero only if the neighbor j of i (except node 0) has a larger scalar value than 
node 0, i.e., < hj, otherwise Hi(j) is equal to unity. Therefore a term under the sum in (IA3I) 
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FIG. 8: Node has a gradient edge from i, if its scalar value is larger than the scalars of all its neighbors 

will be non-zero if and only if for all neighbors j of i (i.e., bij = 1) hj < ho holds, making the 
edge (i, 0) to be the gradient edge for node i. 

The probability that a node will have / in-degree on the gradient network F, is: 

7r 



'=0 I hi nw / ZTT \\ I hi nw 



First, we compute the average over the scalar field. (The order of the averages does not matter, 
however, it is formally easier this way.) Let us denote 

^G(g) = (e"'"''"'), (A5) 

We have: 

Lcisi) = j dho... j dfiM-i r/(/io)..r7(/i^_i)e-''?S-^''^«nf."i'[^-''^+''^ eiho-h,)] ^^^^ 
Let M,{m) = YlT=iH,{j). So 

Lciq) = j dho...j dh^-i r7(/io)..r7(/i;v-i)e-''^^""''«''^(^-^) . (A7) 
Using the recursion 

M,(m) = [1 - hm + hm eih - hm)] M,{m - 1) , (A8) 
the integral over Iin-i can be performed: 

Lciq) = jdho... J dhN-2 'n{ho)-vihN-2) 

X |7(/io)e-''?S""'^o»^-^»(^-2) + [1 - ^(/,o)]e-'''St-^^ao4i-f.iv-i]M,(7v-2)| ^^^^ 
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where 7(x) = dy ri{y). Performing all the integrals recursively, except for h^, we obtain: 

Where J(iV,r2) = /(iM(M[7(M]'^"'"'11 - 7(M]"- Here [r],„ = {r(l), r(2), r^} 
is a n-subset of the set {1,2,...,A^ — 1} and Vn{N — 1) denotes the set of all n-subsets of 
{l,2,...,iV - 1}. We have \Vn{N - 1)| = (^^^). After a change of variables u = j{ho) 
and using du = d'j{ho) = r](ho)ho the integral J(N,n) yields J(N,n) = ;^(^^^) ^, i-C-, the 
in-degree distribution is independent on the choice of the r](h) distribution! 

In the following, we perform the network average {LG{q))nw- ^ fixed n-subset [r]„, let us 
denote: 

= (e-'''^-i'"0"n;=i(i-fe..o))\ . (AlO) 

\ / nw 

Thus, 

1 /AT - 

(Mg)U = ^E , E ^irlM- (All) 

Let 

n 

T„ = [r]„ U U (A12) 

be the set of vertices [r]„ and its neighbors on G. 

Note, that nj=i(l ~ ^ir{j)) = 1 if and only if i ^ T„ otherwise it is zero. Therefore, 

N-\ n 

aoi ~ ^irO)) = the nr. of neighbors of which do not belong to T„. (A 13) 

i=\ j=i 

From (1Mb 

7V-1 

^H.(?) = E- E ^Ki)...^(a^-2iv-i) J]e-'^'^«n;U(i-^>,.o)) (a14) 

101 1JV-2JV-1 *=1 

Since r(j) 7^ 0, ([r]„ G Vn{N — 1)), the sums over the matrix variables a^i can be performed: 

^t;(aoi)e-'^"0'n;=i(i-fc.ro)) = 1 _ ^ + pg^i^n-^ia-M.)) , (A15) 

and therefore 

N-l 



■''iT{j)) 



(A16) 



ai2 ajv-2JV-i «=1 
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The set of vertices {1, 2, ...,N ~ 1} is split into two groups: [r]„ and its complementary in 
{1,2,...,A^ — 1}. Without changing anything, we can rename the vertices, such that {1, 2, n} = 
[r]„ and C[r]„ = {n + l,n + 2, N — 1} he the complementary set of [r]„. It is easy to see that 
only cross-terms (ajj involving one node i from [r]„ and one node j from C[^]„) give non-trivial 
contribution (i.e., different from unity) in (IA16I ). Thus: 

N-l 

i=n+l an ani 

Let tti = 1 — p and f3i = p. Then: 

J2 v{au) [ai + /?,e-'^(^-"")-(^~'^"')] =02 + /?2e-'''(^~"^')-(^-"'") (A18) 

an 

where 02 = {l—p)ai+p and P2 = (1 — The summation over the rest of the matrix elements 
can be similarly performed to give (for a fixed node i) 

an+i + Pn+ie-'" . (A19) 
The coefficients are determined from the recursion: 

ttfc = (1 -P)"fc-i +P , ai = l-p 
= (1 -p)/9fc_i , (3i=p 
which obeys ak + Pk = for all k. These recursions are easily solved: 

an+i = l-p{l-p)\ (3n+i=p{l-pT . {All) 

Thus (IA19I) becomes 1 — p(l — p)" + p{l — p)"e~''. Since for all indices i in ( IA17I ) the result of 
the summations is the same, one finally obtains: 

Ziruiq) = [1 -Ml -P)" (1 - e-^^)]"^-^-" . (A22) 

Because the result in (IA22I) is not specific of the [r]„ set, for all realizations of [r]„, Zy^.^^ (q) is the 
same expression, and thus the sum over all realizations of [r]„ in (lAl II) will generate the factor 
\Vn{N — 1)1 = (^^^) which cancels the combinatorial factor in (lAl II) . Thus: {Lc{q))nw ~ 
T! J2n=o Z[t]Aq)- Plugging this into (IA4I) . and performing the integral over the q variable we 
obtain: 

^(0 = ^ E " " [1 - - PTf [P(l - (A23) 

n=0 ^ ^ 

with the usual convention (^) = for M < m. Equation (IA23I) is the exact expression for the 
in-degree distribution of the gradient network VG. 
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